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taining speech recognition workstations. Human transcrip- 
tion agents may assist in the initial conversion to text data, 
and human editorial agents may further review the audio and 
textual streams contemporaneously, to make corrections, 
add highlights, identify foreign phrases and otherwise 
increase the quality of the transcription service. Subscribers 
to the service may access a Website or other portal to view 
the media and text in a real time or near-real time to the 
original event, and access archival versions of other events 
for research, editing and other purposes. The subscribers 
may configure their accounts to deliver the streaming con- 
tent in different ways, including full content delivery and 
background execution which triggers on key words for pop 
up text, audio, video or other delivery of important portions 
in real time. The subscribers may set up their accounts to 
stream different events at different dates and times, using 
different keywords and other settings. 
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SYSTEM AND METHOD FOR INTEGRATED 
DELIVERY OF MEDIA AND 
SYNCHRONIZED TRANSCRIPTION 

HELD OF THE INVENTION 

The invention relates to the field of communications, and 
more particularly to the delivery of audio and other media 
broadcasts combined with high-accuracy, synchronous tex- 
tual streams reflecting the dialogue in that media. 

BACKGROUND OF THE INVENTION 

The robust growth in demand for both media content and 
delivery channels has increased the need for novel types of 
information, news, financial and other services. The Internet 
and other network technologies have enabled a variety of 
multipoint media streams, such as news Websites containing 
streamable video clips, audio clips and other media combi- 
nations. One frequent type of news source is a collective 
meeting or proceeding, in which one or a few speakers 
discuss information of interest to a wide audience. Those 
types of settings include sessions of Congress, presidential 
and other news conferences, corporate analysts' meetings, 
media conferences and other group events. 

In the case of sessions of Congress and other governmen- 
tal bodies, the timely delivery of the information content is 
particularly valuable. Many interested parties could benefit 
from prompt knowledge of pending provisions in 
legislation, rulings in court cases and other deliberations. 
For instance, individuals or organizations that would be 
affected by the enactment of pending legislation may want 
to furnish input to their representatives. Or constituents may 
want to take other actions to contribute or adjust to new 
statutory, regulatory or other programs. 

The federal government deploys a host of communica- 
tions facilities situated at a variety of sources, often issuing 
permits for access to those resources. For instance, the U.S. 
Congress permits press access to its chambers and hearing 
rooms, from which live video and audio feeds are generated 
for delivery to commercial networks, news and other orga- 
nizations. 

However, in the instance of legislative reporting, there is 
a particular demand for written records of the legislature's 
activities. Public and private organizations exist which take 
down and transcribe the activities of both chambers. Those 
Congressional transcripts are typically made available in 
hard copy or electronic format within about 48 hours from 
the time of the legislative sessions, for a subscription fee. 
This is in contrast to audio or visual feeds for network TV 
or other delivery, which are often contemporaneous with the 
debates and other activities. The media, the public, interest 
groups as well as the government bodies themselves would 
benefit from more timely and robust delivery of both live 
media and concurrent textual streams of the dialogue. 

SUMMARY OF THE INVENTION 

The invention relates to a system and method for the 
integrated delivery of media and synchronized transcription, 
in which a dedicated network collects, processes and deliv- 
ers unified audio, video and textual content on a live basis 
to subscribers. In one regard, the invention may incorporate 
front-end audio or video servers which sense and collect the 
audible or video activities of a legislature, press conference, 
town meeting or other event. 

The raw, digitized media feeds from the event are trans- 
mitted to a centralized distribution server, which in turn 
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delivers the digitized stream of the event to a remote 
transcription facility, where automated and human transcrip- 
tion stages decode the dialogue taking place. After speech 
recognition and editing take place, the textual content is 

5 synchronized with the original audio, video or other media 
and delivered to subscribers, for instance via a Web site 
interface. Subscribers may configure the delivery modes 
according to their preference, for instance to silently parse 
the textual steam for key words, triggering full -screen, 

10 audible, wireless or other delivery of the audio or video 
content when a topic of interest is discussed. 

The subscribers may alternatively choose to view and 
hear the media and textual output continuously, and may 
access archives for the purpose of reproducing text for 

35 research or editorial activities. 

BRIEF DESCRIPTION OF THE DRAWINGS 

The invention will be described with respect to the 
accompanying drawings, in which like elements are refer- 
20 enced by like numbers, and in which: 

FIG. 1 illustrates an overall network architecture for 
delivery of media and text according to one embodiment of 
the invention. 

FIG. 2 illustrates an example of a subscriber interface 
used to view the output produced by the invention. 

FIGS. 3 and 4 illustrate a flowchart of media and textual 
processing according to the invention. 

DETAILED DESCRIPTION OF PREFERRED 
30 EMBODIMENTS 

The invention will be described with reference to an 
illustrative embodiment whose overall architecture is shown 
in FIG. 1. In this embodiment, a Congressional session or 

35 other event is intended to be recorded and delivered to public 
and other subscribers with a simultaneous textual stream. In 
the illustrative embodiment, a microphone array 102 is 
installed in a Congressional chamber, auditorium or other 
even! site. The microphone array 102 is connected to an 

An audio server 104 located at the event site. 

40 

The audio server 104 may be or include a computer 
workstation having one or more high-resolution audio digi- 
tizer boards along with sufficient CPU, memory and other 
resources to capture raw sounds and other data for process- 
45 ing in digital form. In one embodiment, the audio server 104 
may use as an encoding platform the commercially available 
RealProducer™ software to produce a digitized audio 
stream. 

In the embodiment illustrated in FIG. 1, after capture of 

so the spoken words of the event, the resulting raw, digitized 
audio stream is transmitted over communications link 106 to 
a remote distribution server 108 acting as a distribution and 
processing hub. Communications link 106 joining the audio 
server 104 and the distribution server 108 may be or include 

55 any one or more of, for instance, the Internet, an intranet, a 
LAN (Local Area Network), a WAN (Wide Area Network) 
or a MAN (Metropolitan Area Network), a frame relay 
connection, an Advanced Intelligent Network (AIN) 
connection, a synchronous optical network (SONET) 

60 connection, a digital Tl, T3 or El line, Digital Data Service 
(DDS) connection, DSL (Digital Subscriber Line) 
connection, an Ethernet connection, an ATM (Asynchronous 
Transfer Mode) connection, FDDI (Fiber Distributed Data 
Interface) or CDDI (Copper Distributed Data Interface) 

65 connections. 

Communications link 106 may furthermore be or include 
any one or more of a WAP (Wireless Application Protocol) 
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link, a GPRS (General Packet Radio Service) link, a GSM 
(Global System for Mobile Communication) link, or other 
wired or wireless, digital or analog interfaces or connec- 
tions. 

Distribution server 108 incorporates a database 110 for 5 
the mass storage of synchronized collections of audio, video 
and textual information related to individual media events 
collected by one or more audio server 104 or other front-end 
sources. In one embodiment, such additional sources may 
include a portable text -scanning or OCR device such as the 10 
Hewlett-Packard CapShare™ to capture and transmit textual 
information such as press releases, schedules, transcripts or 
other data from the event site along with other media using 
infrared or other connections to communications link 106. 

Distribution server 108 may be or include, for instance, a 15 
workstation running the Microsoft Windows™ NT™, Unix, 
Linux, Xenix, Solaris™, OS/2™, BeOS™, Mach, Apache, 
OpenStep™ or other operating system or platform software. 
In the illustrative embodiment of FIG. 1, the distribution 

20 

server 108 directs the raw, digitized audio stream via com- 
munications link 112, which may be or include similar 
connections as communications link 106, to a processing 
facility 140. 

Processing facility 140 may be a separate facility or other 2J 
internal, local or remote engine dedicated to the transcription 
of the raw media input into character or other format, such 
as ASCII English or other textual or other forms. Processing 
facility 140 may incorporate a voice recognition server 114 
to receive the digitized audio or other media streams for 3Q 
processing and conversion. The voice recognition server 114 
may in one embodiment include one or more speech recog- 
nition module 146, such as the commercially available 
Dragon™ Professional or IBM Via Voice™ product lines. 

Speech recognition module 146 may preferably be 35 
capable of speaker-independent operation. Different or spe- 
cialized versions of speech recognition module 146 may be 
employed within voice recognition server 114 to enhance 
accuracy, upgrade the functionality, or provide special for- 
eign language or other features according to the transcription 40 
needs. The voice recognition server 114 may be attended by 
a human transcription agent to monitor and operate the 
speech recognition module 146 and other components, to 
ensure the smooth flow of first stage conversion from voice 
to text. In the practice of the invention, it may be advanta- 45 
geous to train the speech recognition module 146 with 
particular vocabulary words likely to be spoken at the event, 
before processing the media stream. 

In another embodiment of the invention, the audio server 
104, speech recognition module 146 and other elements may 50 
cooperate to recognize and split different voices or other 
audible sources into separate channels, which in turn are 
individually processed to output distinct textual streams. 

The voice recognition server 114 thus invokes one or 
more speech recognition module 146 preferably with over- 55 
sight or monitoring by a human transcription agent to 
resolve the digitized verbal content generated by the audio 
server 104 into a raw textual stream, for instance ASCII- 
coded characters. Output in other languages and formats, 
such as 16-bit Unicode output, is also possible. The role of 60 
the transcription agent may include the maintenance and 
operation of the speech recognition module 146, monitoring 
the raw textual stream and other service tasks. The tran- 
scription agent's role however is intended to be compara- 
tively limited, and to generally not or not frequently involve 65 
sem antic judgments or substantive modifications to the raw 
textual stream. It may be noted that the role of or need for 
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the transcription agent may be reduced or eliminated in 
implementations of the invention, depending on the sophis- 
tication and accuracy of the speech recognition module 146, 
as presently known or developed in the future. 

Once the initial conversion from original media is done, 
the raw textual stream may be delivered over local connec- 
tion 118, such as an RS232 serial, Fire Wire™ or USB cable, 
to a scopist workstation 120, which may also be located 
within the processing facility 140 or elsewhere. The scopist 
workstation 120 may incorporate text editing software pre- 
sented on a graphical user interface (GUI) 122 for review by 
a human editorial agent, whose role is intended to involve a 
closer parsing of the raw textual stream. 

The tasks of the editorial agent illustratively stationed at 
scopist workstation 120 include review of the raw textual 
stream produced by the voice recognition server 114 to 
correct mistakes in the output of the speech recognition 
module 146, to resolve subtleties, foreign language phrases, 
to make judgments about grammar and semantics, add 
emphasis or other highlights and generally to increase the 
quality of the output provided by the invention. The editorial 
agent at the scopist workstation 120 may be presented with 
the capability, for instance, on the agent GUI 122 to stop/ 
play/rewind the streaming digitized audio or other media in 
conjunction with the text being converted, to compare the 
audible event to the resulting text. 

In one embodiment of the invention, compression tech- 
nology known in the art may be employed to fast- forward 
the media and textual stream for editing or other actions 
while still listening to audible output at a normal or close to 
normal pitch. 

In the practice of the invention, the editorial agent at 
scopist workstation 120 generally attempts to enhance tex- 
tual accuracy to as close to 100% as possible. The invention 
also outputs the synchronized streams with as little lag time 
from event to reception as is possible, to provide an expe- 
rience akin to a "live" television broadcast for the subscriber. 
However, in any implementation some degree of delay, 
including that resulting from processing time in the servers, 
network lag, and human response time of the transcriber, 
editorial agent or other attendants, is virtually inevitable. 
The total amount of delay from event to reception may vary 
according to the nature of the input, network conditions and 
other factors, but may generally be in the range of 15 
minutes or less. 

After all editorial corrections and enhancements are 
entered at the scopist workstation 120, the edited textual 
stream is delivered via communications link 124, which may 
likewise be or include a similar link to communications link 
106, to a text encoder module 126 incorporated within the 
distribution server 108. The communications link 124 may 
also be or include, for instance, a Telnet connection initiated 
over the Internet or other network links. 

The text encoder 126 receives the corrected textual stream 
and converts the stream into, in an illustrated embodiment, 
a RealText™ stream adhering to the commercially known 
Real standard for further processing. The converted Real- 
Text™ stream may be transmitted via connection 128, which 
may be, for instance, a lOObaseT connection to processor 
142. The finished, edited, corrected, converted RealText™ 
stream representing the audible or other events being tran- 
scribed is then sent to distribution server 108, synchronized 
and stored in database 110 with the raw digitization of the 
media from the event, for delivery to subscribers. 

The synchronization may be implemented, for instance, 
using the WallClock function of the commercially available 
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Real software. The Wall Clock function allows multiple appliance such as a WebTV™ unit, radio-enabled Palm™ 
media streams to be synchronized using internal timestamps Pilot or similar unit, a set-top box, a game-playing console 
encoded into each stream. As the streams are received on the such as Sony Playstation™ or Sega Dreamcast™ , a browser- 
client or recipient side, they are buffered until all streams are equipped cellular telephone, other TCP/IP client or other 
at the same internal time to each other. Once the streams are 5 wireless appliance or other communication device, 
aligned in time using timestamp and other information, the The combined, synchronized media and finished textual 
player within the client workstation 136 may start playing stream arriving over subscriber link 134 from database 110 
the streams simultaneously. The distribution server 108 may may be viewed on a client GUI 144 in conjunction with an 
store the finished composite stream or portions thereof in administrative module 138 running on the client workstation 
database 110 in RealText™ or a variety of other formats, for 3Q 136 permitting authentication of subscribers, access to and 
instance in XML, HTML, ASCII, WAV, AIFF, MPEG, MP3, manipulation of the information content delivered by the 
Windows™ Media or others. invention. More particularly, a subscriber may use the client 

The arrival of an illustrative finished RealText™ or other GUI 144 on client workstation 136 to invoke or log into a 
stream into the database 110 may trigger a start code which Website for their information subscription, and enter pass- 
releases the synchronized media and processed textual word and other information to view the synchronized output 
streams for delivery to subscribers to the service of the 15 stream according to their delivery preference. Schedules of 
invention over dissemination link 130. Dissemination link different types of media events, in searchable database or 
130 may, again, be or include a similar link to communica- other form » ma y m another embodiment be presented on the 
lions link 106, such as a single or multiple digital Tl or other client GUI 144 10 in evem selection, 
communications channel. For ^stance, the subscriber may choose to view the entire 

Dissemination link 130 may furthermore be or include a 20 ^rmation stream produced by the invention, including 

Personal Area Network (PAN) a Familv Area Network audl0 ' V,deo and s y nchromzed texlual 0Ut P ul 00 chenl GUI 

✓r-A*^ * * - v \ , 144 using speakers 148, headphones and other output 

(FAN), a cable modem connection, an analog modem con- devjces f £ ^ rther reyiew 

nection such as a V.90 or other protocol connection, an ^ , . , ., , , 

. , „ ^ i xr i /irnvn i o i_ Conversely, the subscriber may enter commands using 

Integrated Service Digital Network (ISDN) or Digita Sub- 25 admiDistrativ e module 138 and cl [ em GU i 144 l0 have th * 

fw ,b n er ^I n ^ (DSL)C ? nCCUO V B1 ^<ooth wireless link a stream de i iver ed silently or in a background 

WAP (Wireless Application Protocol) link, a Symbian™ process> wkh ^ aleft function activated ^ alert fo^^ 

link, a GPRS (General Packet Radio Service) link, a GSM may xan the i ncoming textual stream at the point of the 

(Global System for Mobile Communication) link, a CDMA distribution server 108 or client workstation 136 for the 

(Code Division Multiple Access) or TDMA (Time Division 3Q presence of key words chosen by a subscriber, upon the 

Multiple Access) link such as a cellular phone channel, a detection of which a full screen may pop up showing the 

GPS (Global Positioning System) link, CDPD (cellular surrounding text, video or other content. Alternatively, upon 

digital packet data), a RIM (Research in Motion, Limited) detection of a keyword the alert function may deliver other 

duplex paging type device, an IEEE 802.11-based radio information such as a message or notice via e-mail, an Inbox 

frequency link, or other wired or wireless links. 35 message in Microsoft Outlook™, an online instant message, 

Dissemination link 130 illustratively includes TCP/IP an IRC (Internet Relay Chat) message or other electronic 

connections over the Internet 132 to one or more subscriber bulletin. 

connection 134, which in turn may be or include links In another embodiment, the user may choose to receive 

similar to communications link 106, for delivery to one or the informational content in a silent mode while viewing the 

more client workstation 136. In one embodiment, any one or ^ entire textual stream, but with the ability to highlight por- 

more of communications link 106, communications link tions of the textual stream to then hear the audio output 

112, communications fink 124, communications link 130, associated with that portion. This, for instance, may be 

communications link 134 or other communications links useful for a subscriber wishing to discern emphasis, inquiry, 

may be or include self-healing or self-adjusting communi- irony or other inflections or subtleties that may not be 

cation sockets that permit dynamic allocation of bandwidth 45 evident in textual form. 

and other resources according to local or global network a subscriber operating the client workstation 136 may 

conditions. likewise choose to highlight, cut, paste, stream to hard or 

The client workstation 136 may be or include, for removable drive or otherwise store or archive one or more 

instance, a personal computer running the Microsoft Win- portions of the information content delivered by the inven- 

dows™ 95, 98, 2000, Millenium™, NT, Windows CE™, 50 tion for later processing, word processing, retransmission or 

Palm™ OS, Unix, Linux, Solaris™, OS/2™, BeOS™, other uses. In another regard, subscriber access via the 

MacOS™ or other operating system or platform. Client subscriber links 134 may permit a Website or other entry 

workstation 136 may also be or include any microprocessor- portal to allow a subscriber to access prior news events for 

based machine such as an Intel x86-based device or archival or research purposes. Likewise, the subscriber may 

Motorola 68K or PowerPC device, microcontroller or other 55 manipulate administrative module 138 to schedule the deliv- 

general or special purpose device operating under pro- ery of the streaming service of the invention according to 

grammed control. specified dates and times, events of interest and associated 

Client workstation 136 may furthermore include elec- delivery modes, and other settings, 
ironic memory such as RAM (random access memory) or In this respect, database 110 with in distribution server 108 

EPROM (electronically programmable read only memory), 60 may be configured to be searchable according to discrete 

storage such as a hard drive, CDROM or rewriteable search terms, particular fields related to header descriptions 

CDROM or other magnetic, optical or other media, and of the event, or on other bases. In this regard the database 

other associated components connected over an electronic 110 may be configured with a decision support or data 

bus (not shown), as will be appreciated by persons skilled in mining engine to facilitate the research functions. An 

the art. 65 example of subscriber choices for manipulating the client 

In the modem pervasive computing environment client GUI 144 and associated administrative choices is illustrated 

workstation 136 may also be or include a network-enabled in FIG. 2. 
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General media and translation processing according to the wherein said distribution server synchronizes said textual 
invention will be described with reference to the flowchart stream with said stored media stream and produces a 
of FIGS. 3 and 4. In step 200, processing begins. In step 202, combined media and textual stream, and 
audio or other input from an event is collected and delivered wherein said distribution server outputs said combined 
to audio server 104. In step 204, the raw audio, video or 5 media and textual stream to a recipient such that the 
other signals are digitized. In step 206, the digitized audio recipient can observe said combined media and textual 
data is transmitted to the distribution server 108. In step 208, stream on a synchronized and simultaneous basis, 
the digitized audio stream, in RealAudio™ format, or 2. The system of claim 1, wherein the at least one 
otherwise, is transmitted to the processing facility 140. In streaming media source comprises an audio server output- 
step 210, speech recognition module 146 is invoked to 10 ti ng digitized audio to the first interface, 
output an ASCII text or other stream corresponding to the 3. The system of claim 1, wherein the distribution server 
audio content. comprises a database for storing the media stream and the 

In step 212, the ASCII text stream is output to the scopist textual stream, and the distribution server synchronizes the 

workstation 120. In step 214, the ASCII text stream is edited textual stream and the media stream for storage in the 

by an editorial agent at the scopist workstation 120 using the 15 database. 

agent GUI 122. In step 216, the edited or corrected textual 4. The system of claim 1, wherein the transcription engine 

stream Ls transmitted to the text encoder module 126. In step comprises a voice recognition server executing a speech 

218, the corrected or edited ASCII text is converted to an recognition module outputting a raw transcription stream, 

advanced text format, such as RealText™. the voice recognition server presenting the raw transcription 

In step 220, the reformatted textual stream is stored and 20 stream to a transcription agent to monitor the generation of 

synchronized with the audio or other media source within the raw transcription stream. 

database 110. The integrated media/textual information is 5. The system of claim 4, wherein the transcription engine 

now prepared for subscriber access. In step 222, one or more comprises a scopist workstation, the scopist workstation 

subscribers access the distribution server 108 and are vali- receiving the raw transcription stream from the voice rec- 

dated for use. In step 224, the subscriber's delivery profile 25 ognition server and presenting the raw transcription stream 

is checked to set delivery mode, such as full streaming to an editorial agent to edit and output as the textual stream, 

content, background execution while searching for alert 6. The system of claim 1, wherein the combined media 

terms, or other formats or modes. and textual stream is output to the recipient at a client 

In step 226, the integrated audio or other media along with workstation, 

the textual stream is delivered according to the subscriber's 30 7 - The s y stem of claim 6 > wherein the combined media 

service profile, whether triggering an alert or other mode. In *nd textual stream * 0Ut P ut via ™ Interaet connection, 

step 228, subscriber requests for archival Unking to related 8 * ^ svstem of claim 7 > wnerein the chenl workstation 

sources or other non-streaming services may be processed as comprises an administrative module, the administrative 

desired. In step 230, processing ends. module managing the delivery of the combined media and 

Hie foregoing description of the system and method for 35 textualstream to the recipient. 

* j T i- c j- j 1 a* - *- 9. The system of claim 8, wherein the administrative 
integrated delivery of media and synchronized transcription . , J . , ' . , 
• i * . • • a *• j • i *\ module comprises deli very configurations, the delivery con- 
is illustrative, and variations in configuration and implera en- ^v W m F « y B , j 

.„ . ni , • . c • , „ figurations comprising at least one of full delivery ot the 

tation will occur to persons skilled in the art. t or instance, , • , > i . . i j i i* c 

... . f 1/4n , . .,, , , n( , • combmed media and textual stream, background delivery of 

while the processing facility 140 has been illustrated in . n , ... . . . , / u j »• /, u 

r • i « . •* u .,. ii nF lUa 40 the combined media and textual stream, scheduling of the 

terms of single remote site handling all ot the streaming ,,. ... , _/i. ji- 

. ■ . j • f 'u * j i_ tU A-* me delivery of the combined media and textual stream, de five ry 

media content distributed by the distribution server 108, f j . <• , j • ,u . i 

... . . . . . . • . of an alert based upon detection of a keyword in the textual 

transcription and other processing services could be distnb- vl an annua** j 

. . . .. r i u • . stream, and delivery of the combmed media and textual 

uted to a variety of locations having different computing, j . r , j • .u . i 

j.l p. t • . stream based upon detection of a keyword in the textual 

communications and other resources. For further instance, 45 dU " aiA1 ua ^ u u f wu " vvl * 

the finishing and synching of the integrated text and media Str ?A m *« c i • ^ u • .u * i 

stream could be executed within the processing facility 140 10 of claim }> whe * m the at least ° ne 

when provisioned with sufficient processing, storage and f^?"* " ed,a «»?Pnses a video server outputtmg 

other resources. Furthermore, an event site could have one <«"e»i»a video to the first interface, 

or more audio server 104 or other front-end capture facility ,„ 11 ?* svstem of claim , wh " ein * e textual stream 

to process source media. Moreover multiple events could be 50 teXtUal 0U l pU , m \ plm * hly . of lM 8"»»»- 

processed at the same time, to generate a combined output 12 ^ of claun wh " em ^ t6Xt " al ?™ m f 

stream. The scope of the invention is accordingly intended ^"f 8 textual ° ut P ut «>it«pondjiig to a plurality of 

to be limited only by the following claims. Sp t, e ^! . c , ■ <i •. • .■. . . i . 

What is claimed is: „ 13 V" sy f en J. ° f C }T U \ wh T° lhe f" 01 

1. A system for generating synchronized media and tex- 55 comprises a plurality of channels, each corresponding to one 

tual streams, the system comprising: 0 .? S ? G e ! s * c . . , . 

. ' 3 , r & « , 14. A method of generating synchronized media and 

a first interface to at least one streaming media source; comprising: 

a distribution server in communication with the first . _ . „r„.i ♦ ^ „ tM • 

.. , a) receiving a media stream from at least one streaming 

interface, the distribution server storing the media 60 j- 

& media source; 

stream; and . . . 

, . r . t . ... . t b) storing the media stream m a distribution server; 

a second interface in communication with the distribution 1 & 

server and a transcription engine, said transcription c ) outputting the media stream to a transcription engine; 

engine receiving the media stream and producing a d) generating a textual stream corresponding to the media 

textual stream corresponding to the media stream and 65 stream in the transcription engine; 

outputting said textual stream to said distribution e) synchronizing the media stream with the textual 

server, stream; 
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f) producing a combined media and textual stream; and 21. The method of claim 20, wherein the combined media 

g) outputting the combined media and textual steam to a textual stream are received at the client workstation, the 
recipient client workstation comprising an administrative module, 

15. The method of claim 14, wherein the at least one forther comprising a step of 0) managing the delivery of the 
streaming media source comprises an audio server output- s combined media and textual stream to the rectpient using the 

, administrative module, 

ting digitized audio. 22 method of ckim 2J wherein ^ Q) of 

16. The method of claim 14, wherein the distribution m comprises a step of using the administrative 
server comprises a database for storing the media stream and modu f e * ^ delivefy configurationS) the delivery configu . 
textual stream and wherein the step (e) of synchronizing the rations comprising at least one of full delivery of the 
media stream and the textual stream is performed by the 10 combined media and textua i stre am, background delivery of 
distribution server for storage in the database. me combined media and textual stream, scheduling of the 

17. The method of claim 14, wherein the transcription delivery of the combined media and textual stream, delivery 
engine comprises a voice recognition server, further com- 0 f an alert based upon detection of a keyword in the textual 
prising a step of (h) executing a speech recognition module stream, and delivery of the combined media and textual 
on the voice recognition server and outputting a raw tran- 35 stream based upon detection of a keyword in the textual 
scription stream, the voice recognition server presenting the stream. 

raw transcription stream to a transcription agent to monitor 23. The method of claim 14, wherein the at least one 

the generation of the raw transcription stream. streaming media source comprises a video server outputting 

18. The method of claim 17, wherein the transcription digitized video to the first interface. 

engine comprises a scopist workstation, further comprising 20 24. The method of claim 14, wherein the textual stream 

a step of (i) receiving the raw transcription stream from the comprises textual output in a plurality of languages, 

voice recognition server in the scopist workstation and 25. The method of claim 14, wherein the textual stream 

presenting the raw transcription stream to an editorial agent comprises textual output corresponding to a plurality of 

to edit and output as the textual stream. speakers. 

19. The method of claim 14, wherein the combined media 25 26. The method of claim 25, wherein the textual stream 
and textual stream is output to the recipient at a client comprises a plurality of channels, each corresponding to one 
workstation. of the speakers. 

20. The method of claim 19, wherein the combined media 

and textual stream is output via an Internet connection. ***** 
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